Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
Elle | 1288 | 76 | 2 | 38.0000 |
Mais | 1838 | 71 | 2 | 35.5000 |
Cela | 511 | 32 | 1 | 32.0000 |
close | 545 | 30 | 1 | 30.0000 |
Des | 890 | 60 | 2 | 30.0000 |
Son | 368 | 28 | 1 | 28.0000 |
On | 1445 | 83 | 3 | 27.6667 |
Cette | 1316 | 107 | 4 | 26.7500 |
En | 3108 | 178 | 7 | 25.4286 |
Par | 658 | 25 | 1 | 25.0000 |
La | 6247 | 531 | 23 | 23.0870 |
Il | 4548 | 168 | 8 | 21.0000 |
C’est | 818 | 42 | 2 | 21.0000 |
Au | 910 | 62 | 3 | 20.6667 |
Ces | 707 | 41 | 2 | 20.5000 |
Les | 5945 | 523 | 26 | 20.1154 |
Et | 2004 | 74 | 4 | 18.5000 |
Ils | 835 | 36 | 2 | 18.0000 |
Selon | 273 | 16 | 1 | 16.0000 |
Le | 7678 | 589 | 37 | 15.9189 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
Mar | 530 | 1 | 13 | 0.0769 |
décidé | 178 | 1 | 12 | 0.0833 |
ed | 115 | 1 | 11 | 0.0909 |
T | 142 | 1 | 9 | 0.1111 |
ancienne | 90 | 1 | 9 | 0.1111 |
quatrième | 72 | 1 | 8 | 0.1250 |
Recommandation | 76 | 1 | 8 | 0.1250 |
types | 116 | 1 | 8 | 0.1250 |
Content-Length | 558 | 1 | 8 | 0.1250 |
idéal | 118 | 1 | 8 | 0.1250 |
organisation | 79 | 1 | 8 | 0.1250 |
appareil | 71 | 1 | 8 | 0.1250 |
avion | 54 | 1 | 7 | 0.1429 |
tente | 66 | 1 | 7 | 0.1429 |
roues | 49 | 1 | 7 | 0.1429 |
axes | 43 | 1 | 7 | 0.1429 |
aspect | 57 | 1 | 7 | 0.1429 |
grosses | 60 | 1 | 7 | 0.1429 |
degré | 64 | 1 | 7 | 0.1429 |
terres | 53 | 1 | 7 | 0.1429 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II